NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Few-shot Chemical Reaction Outcome Prediction

https://doi.org/10.1145/3746252.3761236

Shen, Yili; Tian, Yijun; Ju, Cheng-Wei; Wiest, Olaf; Zhang, Xiangliang (November 2025, ACM)

Accurate chemical reaction prediction is essential for drug discovery and synthetic planning. However, this task becomes particularly challenging in low-data scenarios, where novel reaction types lack sufficient training examples. To address this challenge, we propose FewRxn, a novel model-agnostic few-shot reaction prediction framework that enables rapid adaptation to unseen reaction types using only a few training samples. FewRxn integrates several key innovations, including segmentation masks for enhanced reactant representation, fingerprint embeddings for richer molecular context, and task-aware meta-learning for effective knowledge transfer. Through extensive evaluations, FewRxn achieves state-of-the-art accuracy in few-shot settings, significantly outperforming traditional fine-tuning methods. Additionally, our work provides insights into the impact of molecular representations on reaction knowledge transfer, demonstrating that knowledge captured under molecular graph-based formulation consistently outperforms those learned in forms of SMILES generation in few-shot learning.
more » « less
Free, publicly-accessible full text available November 10, 2026
SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

https://doi.org/10.18653/v1/2025.acl-long.424

Zhuang, Haomin; Zhang, Yihua; Guo, Kehan; Jia, Jinghan; Liu, Gaowen; Liu, Sijia; Zhang, Xiangliang (July 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available July 1, 2026
Fair Online Influence Maximization

Wang, Xiangqi; Zhang, Shaokun; Aguilar_Escamilla, Jose E; Wu, Qingyun; Zhang, Xiangliang; Kang, Jian; Wang, Huazheng (June 2025, Transactions on machine learning research)

Fair influence maximization in networks has been actively studied to ensure equity in fields like viral marketing and public health. Existing studies often assume an offline setting, meaning that the learner identifies a set of seed nodes with known per-edge activation probabilities. In this paper, we study the problem of fair online influence maximization, i.e., without knowing the ground-truth activation probabilities. The learner in this problem aims to maximally propagate the information among demographic groups, while interactively selecting seed nodes and observing the activation feedback on the fly. We propose Fair Online Influence Maximization (FOIM) framework that can solve the online influence maximization problem under a wide range of fairness notions. Given a fairness notion, FOIM solves the problem with a combinatorial multi-armed bandit algorithm for balancing exploration-exploitation and an offline fair influence maximization oracle for seed nodes selection. FOIM enjoys sublinear regret when the fairness notion satisfies two mild conditions, i.e., monotonicity and bounded smoothness. Our analyses show that common fairness notions, including maximin fairness, diversity fairness, and welfare function, all satisfy the condition, and we prove the corresponding regret upper bounds under these notions. Extensive empirical evaluations on three real-world networks demonstrate the efficacy of our proposed framework.
more » « less
Free, publicly-accessible full text available June 28, 2026
Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis

https://doi.org/10.18653/v1/2025.findings-acl.1102

Lang, Yicheng; Guo, Kehan; Huang, Yue; Zhou, Yujun; Zhuang, Haomin; Yang, Tianyu; Su, Yao; Zhang, Xiangliang (July 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available July 1, 2026
Improving reaction prediction through chemically aware transfer learning

https://doi.org/10.1039/d4dd00412d

Keto, Angus; Guo, Taicheng; Gönnheimer, Nils; Zhang, Xiangliang; Krenske, Elizabeth H; Wiest, Olaf (May 2025, Digital Discovery)

Pretraining of NERF models on chemically related mechanisms significantly improves the performance compared to pretraining by larger, mechanistically dissimilar reaction datasets.
more » « less
Free, publicly-accessible full text available May 14, 2026
WildlifeLookup: A Chatbot Facilitating Wildlife Management with Accessible Data and Insights

https://doi.org/10.1145/3701551.3704121

Wang, Xiangqi; Yang, Tianyu; Rohr, Jason; Scheffers, Brett; Chawla, Nitesh; Zhang, Xiangliang (March 2025, ACM)

Free, publicly-accessible full text available March 10, 2026
Application of Large Language Models in Chemistry Reaction Data Extraction and Cleaning

https://doi.org/10.1145/3627673.3679874

Huang, Xiaobao; Surve, Mihir; Liu, Yuhan; Luo, Tengfei; Wiest, Olaf; Zhang, Xiangliang; Chawla, Nitesh V (October 2024, ACM)

Chemical reaction data has existed and still largely exists in unstructured forms. But curating such information into datasets suitable for tasks such as yield and reaction outcome prediction is impractical via manual curation and not possible to automate through programmatic means alone. Large language models (LLMs) have emerged as potent tools, showcasing remarkable capabilities in processing textual information and therefore could be extremely useful in automating this process. To address the challenge of unstructured data, we manually curated a dataset of structured chemical reaction data to fine-tune and evaluate LLMs. We propose a paradigm that leverages prompt-tuning, fine-tuning techniques, and a verifier to check the extracted information. We evaluate the capabilities of various LLMs, including LLAMA-2 and GPT models with different parameter counts, on the data extraction task. Our results show that prompt tuning of GPT-4 yields the best accuracy and evaluation results. Fine-tuning LLAMA-2 models with hundreds of samples does enable them and organize scientific material according to user-defined schemas better though. This workflow shows an adaptable approach for chemical reaction data extraction but also highlights the challenges associated with nuance in chemical information. We open-sourced our code at GitHub.
more » « less
Full Text Available
Are we Making Much Progress? Revisiting Chemical Reaction Yield Prediction from an Imbalanced Regression Perspective

https://doi.org/10.1145/3589335.3651470

Ma, Yihong; Huang, Xiaobao; Nan, Bozhao; Moniz, Nuno; Zhang, Xiangliang; Wiest, Olaf; Chawla, Nitesh V (May 2024, ACM)

Full Text Available
Data-Efficient, Chemistry-Aware Machine Learning Predictions of Diels–Alder Reaction Outcomes

https://doi.org/10.1021/jacs.4c03131

Keto, Angus; Guo, Taicheng; Underdue, Morgan; Stuyver, Thijs; Coley, Connor W; Zhang, Xiangliang; Krenske, Elizabeth H; Wiest, Olaf (June 2024, Journal of the American Chemical Society)

Full Text Available
A Property-Guided Diffusion Model For Generating Molecular Graphs

https://doi.org/10.1109/ICASSP48485.2024.10447350

Ma, Changsheng; Guo, Taicheng; Yang, Qiang; Chen, Xiuying; Gao, Xin; Liang, Shangsong; Chawla, Nitesh; Zhang, Xiangliang (April 2024, 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Inverse molecular generation is an essential task for drug discovery, and generative models offer a very promising avenue, especially when diffusion models are used. Despite their great success, existing methods are inherently limited by the lack of a semantic latent space that can not be navigated and perform targeted exploration to generate molecules with desired properties. Here, we present a property-guided diffusion model for generating desired molecules, which incorporates a sophisticated diffusion process capturing intricate interactions of nodes and edges within molecular graphs and leverages a time-dependent molecular property classifier to integrate desired properties into the diffusion sampling process. Furthermore, we extend our model to a multi-property-guided paradigm. Experimental results underscore the competitiveness of our approach in molecular generation, highlighting its superiority in generating desired molecules without the need for additional optimization steps.
more » « less
Full Text Available

« Prev Next »

Search for: All records